Improving Degraded Ancient Document Images Using Phase-based Binarization Model
نویسندگان
چکیده
Here presenting a phase-based binarization model for ancient document images, and also a post processing method which can improve any binarization method and a ground truth generation tool. Three feature maps derived from the phase information of an input document image form the core of this binarization model. These features are the maximum moment of phase corresponding to covariance, a locally weighted mean phase angle, and a phase preserved denoised image. The proposed model consists of three standard steps: 1) preprocessing; 2) main binarization; and 3) post processing. In the preprocessing and main binarization steps, the features used are mainly phase derived, while in the post processing step, specialized adaptive Gaussian and median filters are considered. One of the outputs of the binarization step, which shows high recall performance, is used in a proposed post processing method to improve the performance of other binarization methodologies. Finally, we develop a ground truth generation tool, called Phase ground truth, to simplify and speed up the ground truth generation process for ancient document images. The comprehensive experimental results on the DIBCO’09, H-DIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13, PHIBD’12, and BICKLEY DIARY data sets show the robustness of the proposed binarization method on various types of degradation and document images. Key words—Binarization Model; Phase-Derived Features, Phase Ground Truthing, Document Enhancement.
منابع مشابه
Ancient Document Images Enhancement Using Phase Based Binarization
In this paper, we present a phase-based binarization model for degraded document images, also a post processing method that can improve any binarization method and a ground truth generation tool. Usually, many binarization techniques are implemented in the literature for different types of binarization problems. It include an adaptive image contrast based document image binarization technique t...
متن کاملEvaluation of Ancient Documents and Images by using Phase Based Binarization
Segmentation of text from badly degraded document images are a very challenging task due to the high inters/intravariation between the document background and the foreground text of different document images. These features are the maximum moment of phase congruency covariance, a locally weighted mean phase angle, and a phase preserved denoised image. The proposed model consists of three standa...
متن کاملBinarization Of Ancient Document Images
Ancient documents accumulate a significant amount of human heritage over time. However, many environmental factors, improper handling, and the poor quality of the materials used in their creation cause them to suffer a high degree of degradation of various types. There are lots of ancient documents which are badly degraded. It is very difficult to segment text from the document, as there is a v...
متن کاملImproved Degraded Document images Using Phase Based Binarization
Image binarization plays major role for document image binarization. Scanning and printing of documents can degrades their visibility that makes difficult to understand them. This paper has proposed a new technique which has ability to binarized documents in efficient manner. In the proposed method, a phase-based binarization model for document images is proposed, as well as a post processing m...
متن کاملPhase-Based Binarization of Ancient Document Images
The main defects present in historical documents are darkness, non-uniform clarification, bleed-through and faded characters. To remove these defects binarization method is used. In this paper a phase based binarization method is studied in which phase of ancient document images is preserved. This method is derived in to three steps: preprocessing, main binarization and post processing. In prep...
متن کامل